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Abstract. 

Distorted sums of models were introduced and discussed in [Sh2]. This notion generalizes the 
notion of disjoint (or direct) sums of models by letting the summands overlap. 

In the first section we investigate types in distorted sums and show that the type of a sequence 
of elements a is determined by the 'local' type of a (i.e. the type restricted to a neighborhood of a). 
We simplify the proofs in [Sh] and improve the bounds on the radii needed to determine the types. 

Natural examples of distorted sums are models with distant functions. In the second and third 
sections we discuss such models and improve a theorem by Gaifman ([Ga]), that states that each 
formula is equivalent to a boolean combination of local formulas. 
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1. Distorted sums of models. 



In this section we define distorted sums of models and prove the distorted sum lemma stating 
that types of elements are decomposed into local types and "locational" types in proper expansions. 

Context: All languages are finite and relational. 

a denotes a finite sequence (a , . . . , ft£_i) and lg(a) its length. We will freely write a <E a and 
a C M as if a is a set. a A (b) and a A b denote (a , . . . , ag-i, b). 
M and I denote models and M, I their universe. 

The following definitions are as in [Shi] §2. 

Definition 1.1. Let M. be a r- model and a C M. 

1. Th(.M; a) is the set {^(x) : ip G £ r , A1 |= ^(a)}. 

2. The n-type of a in M or the n-theory of (M; a), Th n (M; a) is defined by induction on n: 

Th°(M;a) is the set {ip(x) : M \= ip(a), tp quantifier free}; 
Th n+1 (M;a) is the set {Th n (M;a A b) : b G M}. 

3. is the set {Th n (M; a) : M a r-model, a C M, lg(o) = £}. (It is finite when r is). 

4. When a % M, Th"(X;a) is Th"(X;a n M) (and we understand it as including the 
information which members of a belong to M). 

Clearly for every formula <p(x) there is some n G N such that for every M and a C M, we can 
determine whether M \= ip(a) from Th n (M; a). 

Definition 1.2. We say that ip(x) G £ T is of depth n and write dp(iys) = n if n is minimal 
with the above property. 

Now dp(<p(x)) — iff ip(x) is quantifier free, and dp((p(x)) = n + 1 iff </?(5) is (equivalent 
to a formula) of the form /\ ieI ±(3y)[il>i(x, y)] where for i G I dp(^) < n and for some i G J 
dp(V'io) — n - (i^ means ^ or -i^). 

We also have for every t G T T nl some formula tp(x) such that dp((p) = n and for every M and 
a C M, h ^(o) ^ Th"(A4;a) = t. (See [Shi] lemma 2.10). 

A distorted sum of models is a model whose universe is a distorted union: 

Notation 1.3. Let (I,d) be a metric space and {A t : t G 1} be pairwise disjoint. For t £ I 
let M t := U{A S : s) < 1}. 

M = uf eJ M t (= UteiA) is called the distorted union of the M t 's. 

For a G A define /i(o) to be the unique t G / such that a G A t . /i(a) is (h(a ), . . . , /i(a r _i)). 
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Conventions, (i) From now on (7, d) will be a metric space and 1 a model with universe 7. 
M is U t d eJ M t and h: M I as above. 

(m) a, 6, c, <i denote elements in M and s, t, u, v elements in 7, usually h(a) — s and h(b) = t. 
(Hi) When Mfl/^we stipulate = t for f G M n 7. 

(iv) d(a, b) for a,b e M is d(h(a), h(b)), d(a, t) for a 6 M and t £ I is d(h(a),t). 

(v) d(i, s) < k means d(tj, s) < k for some ij G i. d(i, s) > k means -<d(i, s) < k. 

Definition 1.4 (Distorted Sum of Models). Let X be a To-model and d a metric on 7. Let, 
for t £ I, Mt be a n -model. We say that a r 2 -model M is the d-distorted F-sum of {Alt '■ t £ 1} 
and write X = J2tei if 

(a) M is a disjoint union of some {A t : t £ 7}, 

(b) M t = U{A S : s £ I, d(t, s) < 1} for each t £ 7, 

(c) if b = (b Q , 6 r _i> C M, then 

Th°(>[; 6) - ^(Th ^; /.(&)),. . . ,Th° (M h{be y,b) , . . .) . 

Remark. Assuming s i= t ^> d(s, t) > 1 we see that a direct sum of models is an instance of 
the above definition. 

We define now some neighborhoods of finite sequences of elements and the notion of an n- 
componcnt. These will be needed in formulating the distorted sum lemma. 

Definition 1.5. (i) Let J C 7 (usually finite). Define a binary relation Ej on J by 

Ej(t,s) (3-fio, . . . ,Uk £ J)(uo = t k Uk = s & (i < k => d(ui,u i+ i) < 2"). 

(ii) Call J' C J an (n, J)-component if [s, i e J'] => 7^j(t, s). 
(Hi) We say that J is an n-component if [s, i G J] =>■ Ej(t, s). 
(iv) (ao, • • • , a£_i) is an ri-component if (h(ao), ■ ■ ■ , h(ai-\)) is. 

(u) Suppose (oo, • • • , fl^-i) C M, and /i(oj) = s, for i < L Define M"(a) C 7 U M by 
M"(a) :={t£l: (3i < £)[d( Si ,t) < 2"]} u(J{M t : (3i < £)[d( Si ,t) < 2"]}. 
we shall not distinguish between M n (d), M n (s) and even M n (a 0} . . . , a e _ 2 , s^-i)- 

Facts 1.6. (i) Ej is an equivalence relation on J, hence when J is finite there is a unique (up 
to permutations) partition J = J° U J 1 . . . U J r_1 into n-components, with r minimal. 
(ii) E 1 }- 1 refines EJ. 
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Proof. Clear. 

We will call a decomposition as in (i) above a minimal decomposition into n-components. 

Notation. For n, £ £ N define a(n, £) by recursion: a(0, £) = and a(n, £) = a(n — 1,^+1) + 
£ + 2 for n > 0. 

Definition 1.7. Let n,/6N and a = (a , . . . ,a*_i) C M. The models A4"(a) and are 
defined as follows: 

(A) A4 n (a) is the model with universe M n (a) and with the signature r" consisting of the following 
relations: 

(i) restrictions of the relations of M and X i.e. for every i? £ n we let 
Qf ' n = {(co, • • • , Cfe_i) C M n (a) H M : A4 \= R(c)}, 
and for every R £ t we let 

Q*- n = «to,...,t fc -i> cM"(o)n/: ihfi(f)}; 

(m) distances i.e. for each k < 2™ 

D fc <" = {(an, a*) : zi.a^ e M n (a), d{ Xl ,x 2 ) < k}, 
(Hi) the graph of h i.e. 

H = {(c,u) :c,u£ M n (a), h(c) = u}; 
(iv) unary predicates expressing x £ M and x £ I. 
M n (b) is equal to M n (a) when h(a) = h(b). M n (s) is M n (a) when h(a) = s. 

(B) T n ' e is the model with universe / and with signature Tq' ( defined by induction on n. Tq' £ is To 
and for n > 1 Tq'* is and the following relations: 

(i) distorted Feferman-Vaught predicates k e for k < i where R\ k i(uo, ■ • ■ j u^-i, u&) holds 
iff (uo, ■ • • , Uk) is an n-component and there exist cq, . ■ . , Ck-i in M such that h(ci) = Ui for i < k 
and 

Th»("- y+1 »(M"(«); Co , , . . . , <%_!,«*) = t ; 
(m) distances i.e. for fc < 2™ 

D fe = {(s,t) e J 2 : d(*,t) < fc}. 

The next facts are used in the proof of the distorted sum lemma. 
Fact 1.8. Let n, m,£,r £ N. 

(i) Let x = (x , x r -i) C MUl. From t := Th n+1 (M m+1 (x); x) one can effectively compute 
Th n (M m+1 (x);x), Th n+1 (M m (x);x) and Th n+1 (M m+1 (x');x') for every x' C x. Moreover, from 
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t one can compute a minimal decomposition x°, . . . , x k 1 of x into m-components and the theories 
Th n+1 (M m (x l );x l ) for i < k. 

(ii) Let s — (so, ■ ■ ■ , s r -i) C I. From s := Th" +1 (I m+1,£ ; s) one can effectively compute 
Th"(Z m+1 ' £ ; s), Th n+1 (Z m ' £+1 ; s) and the set {Th"(Z m+1 ' £ ; s A t) : pr(i)} where pr(i) is a statement 
of the form d{s, t) < k or d(s, t) > k for k < 2 m+1 . 

Proof. Easy. The main facts that are used are that distances < 2 m+1 are given by formulas 
of depth and that t™ +1 and Tq 1+1 ' £ are richer than t{™ and r^ e+1 respectively. 

The distorted sum lemma states that for every a C M of length £, the n-theory of a in M. can 
be computed from the local theories of the n-components of a and the theory of h(a) in I n ' 1 . 

Lemma 1.9 (The Distorted Sum Lemma). Let M = J2tei -M-t be a distorted sum of 
models. For every n, I G N there is a function F n ^ such that if a = (do, ■ ■ ■ , a^-i) C M, and 
a , . . . , W' 1 is a minimal decomposition of a into n-components then 

Th n (M; a) = F„,,(Th"(J"^; h(a)), . . . , Th n {M a ^ l \a% a% . . . ). <r . 

Proof. We will prove the existence of F n ^ by induction on n and for every I. 
F 0i £ is F from the definition of distorted sums. 

Let a = {a , . . . ,ae-i) C M and a°,...,a r ~ 1 be a minimal decomposition into (n + 1)- 
componcnts. Suppose now that F n j + i exists, denote h(a,i) = Si. Now we would like to compute 
t = Th n+1 (M; a) from the data 

Th" +1 (r l+M ; a),..., Th a(n+1 - e \M n+1 (a l );a l ), ...(*< r). 

Represent t = {Th"(X; a A fc) : 6 e M} = T : UT 2 where 

Ti = |Th™(X;a A fe) : 6 E M , 6 ^ U i<r M n (a < )} 

and 

T 2 = {Th"(X; a A fe) : b e M, fee U 4<r M"(a 1 )} 
and let's compute Ti and T2 separately. 

Ti: When b g U i<r M n (a l ) 1 d(a,b) > 2™. Therefore a minimal decomposition of a A 6 into 
n-components refines a , . . . , a r_1 , (6). Let ((6), . . . , a 4J , . . . ) i<r . <fc . be a minimal n-decomposition 
of a A 6. Denote = i. By the induction hypothesis we have 

Th n (M;a A b) = 

= F nA+1 (Th n (l n < t+1 -,s A t),Th a(n < e+1) (M n (b); b), . . . , Th Q( ™< £+1) (AT(a^); a**), . . .), 
therefore for computing T\ it is enough to compute the set of possibilities 
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T\* := {(Th™(J"^ + ^s A t),Th Q( "^ +1 HA^ Il (&);fo),...,Th Q( "' £+1) (A^"(a 4 ^');a l J),...) : 

d{s,t) >2 n , be h-^t)}. 

Our data includes Th Q(n+M) (M n+1 (a l );a l ) for i < r and by 1.8(a) (as a(n + 1J)> a{n,i+ 1)), 
from each such theory we can compute a minimal refinement into n-componcnts a 4 ' , . . . , a l ' fci_1 and 
the theories Th a( - n ^ +1 \M n (d i ' j );d i ' j ) for every j < h. 

On the other hand Th n+1 (2 n+1 > e ; s) gives us {Th n (T l+1 ' e ; s A t) : d(s,t) > 2"} and from this 
we can compute {Th n (J n ' e+1 ; s A t) : d(s,t) > 2"} (by 1.8(n)). From each Th"(J™ +1 ^; s A t), us- 
ing the predicates i?£ +lj0 . £ in Tq +m we can compute the set {Th a{n+1 ' e) (M n (t); t) : t G 1} 
hence (as the graph of h is part of rf) the set {Th a ( n+M > -1 (.M n (t);i,&) : i € J, h(t) = b} 
and hence {Th a{n+1 ' e) - 1 (M n (t);b) : = &}. As a(n + 1,1) - 1 > a(n,t + 1) we can get 

{Th a{n ' i+1 \M n (t); b) : h(t) = b}. Together we get T\*. 

T 2 : Let b E M and /i(b) = t. Suppose that for some k < £ we have d(tk,s) < 2" and that 
at e a* then a minimal n-decomposition of a A b refines a , . . . , a 1 A b, . . . , a*" -1 . 

Let's assume for simplicity that d° A d 1 , a 2 A . . . A a r ~ 1 is a minimal (n + l)-dccomposition of a 
and that a , a 1 , a 2 , . . . , a r ~ 1 is a minimal n-decomposition of a. We will show that we are able to 
compute 

T:= {(Th n (:T^ +1 ;s A *), Th Q( ™< £+1) {M n (d° A b); d° A b), Th Q( ™< £+1) (^"(a 1 ); a 1 ), . . .} : 

a A 6, a 1 , ... is a minimal n-decomposition of a A 6}. 

Clearly computing T2 can be reduced to computing a set like T (and then using F n j+i that exists 
by the induction hypothesis). 

Now Th n+1 (T l+1 ' e ; s) gives us the set 

{Th n (2 n+1 ' e ; s A t) : s° A t, s 1 , s 2 , ... is a minimal n-decomposition of s A t} 

for each f as above, the theories Th a{n/+1) (M n (d t );d I ) for < i < r are fixed and computable 
from Th a(n+1 ' e \M n+1 (a° A a 1 ); d° A a 1 ) and Th Q(n+M) (A4 rl+1 (a 2 A a 3 . . .); a 2 A a 3 . . .) which are part 
of the data, so we only have to compute 

{(Th n (T l ' e+1 ;s A t),Th a(n ' l+1 \M n (a 0A by,a ()A b)) : fee h-\t)}. 

Therefore, given f it is enough to compute the set {Th a{n ' e+1) (M n (a A b); a° A b) : b G /i _1 (t)} from 
s = Th"(J™ +1 ^; s A t). (All the possible s's as well as s" := Th n (2 n,/+1 ; s A t) are given by the datum 

Th n+1(2»+M;a)). 

Now Th Q( "^ +1) (X™(a° A 6); a A 6) belongs to this set if and only if there exists t* G M™ +1 (a°) 
such that h(t*) = b, d(s a ,t*) < 2" and for every c° C M n (d°) with /i(c°) = h(a°), 

t = Th o W +1 )(F( s - 0A f);^f) iff (=i^ +M (5°,t). 



6 



This can certainly be computed from Th a( ™ +1 ' £) (X" +1 (s°); a ) as lg(s°) < I and a(n + 1,£) = 
a(n, I + 1) + t + 2. So we are done. 



Remark. Comparing with [Sh2] §3 and §5, one sees that the size of the neighborhoods taken 
around a sequence a C M is improved to be a power of 2 (< 2 n+lg ( a )) rather than a power of 3. 

One instance in which the distorted sum lemma is of no use is when M=I and for t € I 
A t = {t}. In this case Th n (J n ' e ; h(a)) is stronger then Th n (M.; a) and computing the second theory 
from the first one is pointless. We will formulate an abstract version of the distorted sum lemma 
(in which Th n (2™'^; h(a)) is simplified) and prove in the next sections that the setup can be realized 
and applied in the context of models with distance functions. 

Context. Let (/3(n) :n£N) and (m(n) : n G N) be strictly increasing, satisfying [3{n) > n 
and 2 m <" +1 ) > 3 • 2 m <") (m(n) = 2n will be used later). 

The following definition (and the lemma below) assume that we have fixed (/3(n) : n G N) and 
(m(n) : n € N). 

Definition 1.10. Let n, k G N and s = (so, . . . , st-i) C I. The model I n and the theory 
DTh fc (J";s) are defined as follows: 

(A) 2° is X and for n > 1 X n is the model with universe / and with signature Tq consisting of To 
and distorted unary Feferman-Vaught predicates where Q„(u) holds iff and there exists c G M 
such that h(c) = u and 



(l~ n for n > 1 is 2°). 

So DTh fc (I™;s) describes elements that are distant from s. Note that T n+1 is richer than 1" 
(if f3(n + 1) is big enough), however in general there is no reason to assume that DTh fc (Z";s) is 
computable from DTh fe+1 (J n ; s). 

Notation 1.11. When x = (x , . . ■ , 2^-1} and y are in M U I, we define 



Th^ n \M m{n \u);c)=t. 



(B) DTh°(I";s) is Th°(J";s) and 




Th n (M m (x);y) := (Th"(A4 m (x ); y), . . . , Th n (M m (x e - 1 ); y)). 
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Lemma 1.12 (The Distorted Sum Lemma Abstract Version). Let M = J2tki 
be a distorted sum of models. For every neN there is a function F n such that if a C M, then 

Th n (M;a) = F„(DTh n (X"; h(a)), Th 0{n) (M m(n) (a);a)) 

provided that: for every k <n and for every a and b in M with d(d, b) < 2 m ( fc ~ 1 ), 

<g) n DTh fe_1 (X fe_1 ; h(d) A h(b)) is computable from 

DTh fe (J fe ; h{o)) and Th^" 1 ^^-^^^^. 

Proof. By induction on n. F is given by the definition of distorted sum of models (and we 
may choose (3(0) = m(0) = 0). 

Let a = (ao, . . . , ae-i) be given. Assuming ®„ +1 and the existence of F n we would like to 
compute Th n+1 (M;a) from DTh n+1 (l n+1 ; h(a)) and Th P{n) (M m(n+1) (a);a). 

As before, wc partition {Th n (M; a A b) : b e M } = Th n+l (M; a) into T x U T 2 where 

Ti = {Th n (M;a A b) : b e M, b <g M m ^ n \a)} 

and 

T 2 = {Th n (M;a A b) :beM, be M m( * n \d)} 
and compute T\ and T2 separately. 

If 6 g M m (")(a) then d(a, 6) > 2 m (") hence 

Th /3(n) (M m(n) (a A 6) = Th /5( " ) (A4 m( " ) (a);a) A Th w (M ra(n) (l); 6). 

This sequence is computed from the datum Th /3( ™ +1) (X m( ™ +1) (a); a) and from Th fj(n) (M m{n) (6); b). 
Now DTh™ +1 (T™ +1 ; h(a)) gives us the set of possibilities 

{(DTh n (2 n ;h(a) A t), Th /3( " ) (M ro( " ) (6); 6)) : & G /T 1 ^), d{h{a),t) > 2 m(n) } 

And now we use F„ to compute Ti. 

For computing T 2 it is clearly enough to show how to compute 

T* = {Th n (M;a A b) :beM, be M m ^ n \a Q )}. 

Let b e M m( - n \a ) and i < I be given. If it happens that d(a ,a t ) < 2 • 2 m(n *> then (as 2 m <" +1 ) > 
3 • 2 m <")), M m(n \a t ) C M m (" +1 )(a ). On the other hand, if d(a ,a 4 ) > 2 • 2 m <") then we must have 
d(M») > 2 m(n) i.e. 6 £ M m (")(a») and a* £ M m(n \b). 

Therefore we can compute Th* 3{n) (M m{n) (a l );d A b) from Th /J(n) (A^ m(n+1) (o );a A 6) when a, is 
close to ao, and from Th^ n) (M m(n+1) (a,); a) when di is distant from ao (and we know which of the 
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possibilities holds from Tli (M m( " +1) (ao); a)). Similarly, we can compute Th^ n \M m( - n) (b);a A b) 
from Th /3(n+1) (M rn{n+1) (a );a A 6). 

To sum up, the datum T\J 3[n+1) {M m{n+1) {a);a) gives us the set of possibilities 

{(Th P{n) (M m(n) (d A b);a A b), Th^ (n+1)_1 (M m(n) (a); a A b)> : d(a ,b) < 2 m{n ^} 

and using the datum DTh™ +1 (Z" +1 ; h(a)), the assumption ® n+1 and the existence of F n , we get 

1 2 ■ 



2. Distant elements in metric spaces. 

Here we are concerned with the problem of determining, in a given model, the existence of 
elements that are distant from a tuple a given the local theory of a. From now on it will be more 
convenient to work with a new set of neighborhoods: 

Notation 2.1. let (I,d) be a metric space and a C I, we define 

V k (a) := {tel : d(a,t) < k}. 

So our problem is the following: let (/, d) be a metric space and I be a To model with universe I 
(tq is a finite relational signature) . Suppose that a and b are ^-tuples of elements in / and let C C I 
be a definable set such that there is some c G C with d(a, c) > m. What size of neighborhoods 
V k (a) and V k (b) should we take around a and b in order the ensure that (for sufficiently large n) 

(Th n (V k {d);a) = Th"(V fe (6);6)) (3e e C)[d(b,e) > m] 

(V k (a) is the submodel with universe V k (a)). 

The answer, in the context below, is that diameter 3m suffices. The proof is basically given in 
[Ga], but the bound is slightly improved and shown to be a lower bound. 

Context. 

(1) Distances up to 2m (i.e. "d(x, y) < k" for k < 2m) as well as "c € C" are definable in X. 

(2) V k {a) computes correctly membership in C (e.g. when C is a unary predicate in To) and distances 
up to 2m. 



Lemma 2.2. Let (ao, • • ■ , ae-i), (bo, ■ ■ ■ , bt-i) C / and assume that (for a large enough n) 

(*) Th"(V 3m (a);a) = Th"(V 3m (&); 6) 

and that 1 \= (3c e C)[d(a, c) > m]. 
Then 1 \= (3e £ C)[d(b, e) > m}. 



Proof. First we ask: is there a 'scattered' sequence of length I + 1 i.e. is there a sequence 
(co, . . . , ce) of elements of C such that Ai<j<£ [rf(cj, Cj) > 2m]? If the answer is positive then for each 
i < £ there is at most one j < I such that d(bi, Cj) < m hence there is some c G C with c) > to. 

Otherwise let k < t be maximal such that there is a scattered sequence of length k. Now ask: is 
there such a sequence in V m {a)l If there is one, then, as a scattered sequence of length k can not be 
extended, all members of C are within distance of < 3to from a i.e. in V 3m (d). By the assumption, 
V 3m (d) contains an element c* G C such that d(a,c*) > to. Th(V 3m (a); a) reflects this fact and by 
(*) the same holds in V 3m (b). 

Lastly, suppose there is no scattered sequence of length k inside V m (a). By (*) there isn't 
one in V m (b). Necessarily there is an element of e G C such that d(b,e) > to (because a scattered 
sequence of length k exists somewhere in I) . 

Remark. The depth n of the local theory Th"(V fe (a); a) depends on lg(a), the depth of 
"c G C" and the depth of "dfo y) < 2m". 

Let us show now that 3m is the minimal diameter that is needed to take in order to ensure the 
existence of distant elements. For a graph (G, R) let d(x, y) be the natural distance function defined 
by d(x,y) = iff x = y and (3u , ■ ■ ■ ,u n )[x = u & y = u n & A l<n u t Rii i+ i] => d(x,y) < n). 

Example 2.3. Let (G,R) be an infinite graph. Let C C G be an infinite subset such that for 
every c G C {e G C : d(c, e) = k} is infinite for k = 1, 2 and such that c£(c, e) < 2 for every c, e G C. 
Suppose a, 6 G G\C satisfy: 

(i) d(a, c) < 1 for every c G C except for a unique c* G C that satisfies d(a, c*) = 3, and 

(m) c) < 1 for every c G C. 

When t consists only of the graph relation R and the unary predicate C we have Th(V 2 (a); a) = 
Th(V 2 (6);6) but no element of C has distance > 1 from b. We need to take V 3 (a) and V 3 (b) to 
distinguish between the theories. 



3. Models with a distance function. 

As an application we will consider models with a distance function and improve a theorem by 
Gaifman ([Ga]). 

Context 3.1. Let r be a finite, relational signature. When M is a r- model we can define a 
distance function d between the elements of M as follows: 
d(a,b) = iff a = b, 
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d(a, b) < 1 iff for some predicate R G r (including equality) and some tuple c G M containing 
both a and b, M |= -R(c), 

6) < n + 1 iff for some c G M d(a, c) < n and d(c, 6) < 1, 
d(a, b) = n iff <i(a, b) <n and <i(a, 6) ^ n — 1, 
d(a, 6) = oo iff d(a, b) ^ n for all n. 

If all the relations in r are binary then (i(a, 6) = 1 is expressible by a quantifier free formula 
and (by induction) d(a, b) < 2 n and d(a, b) > 2 n are expressible by formulas of depth n. In general, 
if k is the maximal arity of the relations in r then d(a, b) < 2™ and d(a, b) > 2™ are expressible by 
formulas of depth n + k — 2. 

Representing a model with a distance function as a distorted sum of models: Let 

M be as above, and for b G M let A b = {b} and M b = {a G M : d(b,a) < 1}. Clearly M is 
the distorted union of {Mb : b G M} and as by the definition of d there is a function F such 
that Th°(X;6) = F( . . . , Th (X^ ; 6), . . . ) i<lg(6) - Therefore 7W = T,teM M b, ( thc function /i is 
of course the identity). 

As M = I we are not interested in the main version (1.9) of the distorted sum lemma, however 
the abstract version of the distorted sum lemma is applicable in a slightly modified form. 

Lemma 3.2. Let (fi{n) :neN) and (m(n) : n G N) be strictly increasing, satisfying f) '(n) > n 
and 2" l (" +1 ) > 3 • 2 m{ - n \ Let M be a r-model with a distance function defined as above. 
For every n G N there is a function F n such that if d C M, then 

Th n (M;d) =F„(DTh"(J";a), Th^(V rW (a);a)) 

provided that: for every k <n and for every d and b in M with d(d, b) < 2 m ( fe ~ 1 ) ; 

(g)„ DTh k - 1 (l k - 1 ;d A b) is computable from 

DTh fc (J fc ;a) and T^^'- 1 ^ 2 """ 1 ' (a); a A 6). 

Proof. Similar to the proof of lemma 1.12. The only difference is that we replaced the 
sequence Th^ (n) (A^ ro(n) (o);a) with (the stronger theory) Th p(n) (V 2 ™'"' (a); a). 

Remark. We shall see below that DTh"(J"; a) is in fact determined by Th 13 ^ (V 2 ^ (a); a) 
so it can be eliminated. 

Definition 3.3. <fi(xo, ■ ■ ■ ,x m ^i) is a /c-local formula if it is (equivalent to) a formula where 
all the quantifiers are of the form (3u G V k (x)) and (Vw G V k (x)). We denote /c-local formulas by 
^ k \x). 
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Gaifman's theorem states that in our context all formulas are equivalent to Boolean combina- 
tions of local formulas. 

Theorem 3.4 (Gaifman). Every first order formula a(u , . . . , u m -\) is logically equivalent 
to a Boolean combination of 

(I) Sentences of the form 

(]d„,...,Vi)[A^W & A d(v i ,v j )>2r] 

i<s i<j<s 

where tp^(vi) is obtained from ip^(vj) by substituting vt for Vj. 

(II) Local formulas of the form ip^ l \w) where w C u. 

Moreover, if the quantifier depth of a(u) is n then the following inequalities can be guaranteed: 

r<7™-\ s<m + n, t < -(7" - 1). 
If a is a sentence then only sentences of the form (I) occur. 

We improve of Gaifman's theorem be replacing the exponent of 7 is replaced by an exponent of 

4. 

Basically we are interested in computing the diameter k{n) = 2 m (") that can be used in lemma 
3.2. We show that k(n) can be chosen to be equal to 3 • 4 n_1 . The depths f3(n) are not specified 
and we shall write occasionally Th(. . .) instead of Th' 3 '"^. . .). In general this can be construed as a 
theory that is rich enough to express the relevant distances (this depends of course on the arity of 
the relations in r). 

Fact 3.5. Let a = (a , . . . , a*_i) C M and b E M. 
For every m£N, Th"* (V 3m+1 (a); a A 6) and Th"(V m (6); a A b) determine Th"(V m (a A b); a A &) 
hence (. . . , Th n (V m (ai); a A 6), . . . , Th n (V m (6); a A 6)). 

(Here n* = n*(n,m) is n + the depth needed to express distances up to m). 

Proof. There are 2 cases: When d(o, 6) < 2m+l, V rn {b) C F 3m+1 (a) and Th"(V m (a A 6); a A 6) 
is determined by Th™* (V 3m+1 (a); a A 6). 

When d(a, b) > 2m + 1 for every relation R G r, if V" l (a A fe) |= i?(c) then cither c C T/ m (a) or 
cCV m (b). It follows that Th n (V m (a);a) and Th"(V m (6); 6) determine Th"(V m (a A 6); a A b). 

Definition 3.6. Let k(n) £ N be minimal (if it exists) such that Th(V k ^ n \a);a) determines 
Th n (M; a) for every M and a C M. 
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Fact 3.7. (1) k(n) exists for every neN 
(2) fc(0) = 1 and fc(l) = 3. 

Proof. Gaifman's theorem gives us the existence of fc(n). fc(0) = 1 is trivial. Given 
a C M, Th^Mja) is {Th°(A4; a A 6) : 6 e M}. When d(a,6) < 1 Th°(M;a A b) is computable 
from ThO^a);^). When d(a,b) > 1 then Th°(M;a A b) can be computed from Th°(X;a) and 
Th°(.M;&) where the latter can express only formulas of the form R(b) for R a unary predi- 
cate in r. One needs only to know if there exists a b £ M with d(a, b) > 1 such that M. \= 
[Ri(b) & R 2 (b), • ■ • , (6) & -><9 2 (6) • • •]• Th(V 3 (a); a) can supply this information by Lemma 2.2, 
Th(V 2 (a);a) can't by example 2.3. 

Lemma 3.8. k(n) < 3 • 4™- 1 for n > 1. 

Proof. By induction on n where the case n = 1 is given above. 
As before, we decompose Th n+1 (M;a) = {Th n (M;a A b) : b £ M} into 

Ti := {Th"(X;a A fe) : 6 e M, d(a,6) < fe(n)} 

and 

T 2 := {Th"(X;a A fe) : b £ M, d(a,b) > k{n)} 

and compute them separately. 

Let (3{n + 1) be (3(n) plus the depth required to define distances < 2 m ( n+1 ). 

For computing T\ it is enough, by the induction hypothesis, to find the minimal k £ N such 

that 

Tb.K n+1 '>(V k (a);a) detrmincs {Th^ (n) (V k{n) (a A 6); a A b) : b £ M, d(a,b) < k(n)}. 
Clearly k = 2k(n) is as required. 

For the second part we will use lemma 3.2. Let m(0) = 0, m(l) = log 2 (3) and m(n + 2) — 
m(n + 1) + 2 so DTh"(Z"; o) describes elements of distance > 3 • 4 Il_1 from a. We need this: 

Subclaim. For every n > 1 Th /3( " ) (V 2m( " ) (a); a) determines DTh"(J";a). 

This is proved by induction on n, for every a. The case n = is clear. Assuming the subclaim 
for every k < n, we have to show that DTh n+1 (I" +1 ; a) is determined by Th /3(n+1) (V 2 " 1< " +1> (a); a). 

Now DTh" +1 (J" +1 ;a) is the set of pairs (DTh n (J"; s A t), Th^I"; s A t)) with d(s,t) > 2 m <"). 
To compute it we need to know Th /3(n) (A^ m(n) (a,);a 4 ) for i < lg(a), which is computable from 
Th /3 ^ n+1 ^(V 2 ( + ) (a);d). Next, let us ask, for each possible theory t, if there is b £ M with 
d(a, b) > 2 m (") = 3 • 4"- 1 such that Th p{n) (M m{n) (b); b) = t. Reading the proof of lemma 2.2 we 
see that this is determined by Th^ (rl+1) (y^ m{n) +^ min) n ) = Th /»(n+i)(v 2m(B+1> (a); a) (we have 
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to 'see' not only distant b's but also balls of radius 2 m (") around them, hence the extra 2 m (")). So 
the possibilities for Th°(J"; s A t) arc determined. 

Now by fact 3.5 having T ^("+ 1 )(v 2m( " +1> (a); a) and Th^ (rl) (y 2 "*'"' (6); 6) we can compute 
Th /3( " ) (V 2 " ,< " > (a A 6);a A 6). By the induction hypothesis we get DTh n (J"; a A b). 

Therefore we can compute DTh n (J";a A 6) from Th /3( ™ +1) (y 2? " ( " +1) «) an d Th°(J";a A 6). 
Hence DTh n+1 (T ra+1 ; a) is determined as required. 

Note that as Th /3( " +1) " 1 (V 2 " 1< " +1> (a); a) determines Th PW (V 2mW (a); a) for fc < n+1, it deter- 
mines ; a) as well. 

Returning back to the main proof, we choose k(n + 1) = 4 ■ fc(n). By the subclaim and its proof 
Th /J < n+1 >(V fc(n+1) (a);5) determines the set 

T* = {(Th /3( " ) (V 2 " l< " ) (a A 6);a A 6), DTh n (I"; o A 6)> : d(a, 6) > fc(n)} 

where m(n) is log 2 (fc(n)). To apply lemma 3.2. we only have to check (^) n however the fact 
that Th ,3(r) - 1 (V 2 " ,<r> (a);a A 6) and DTh r (J r ;a) (superfluous) determine DTh r - 1 (2 r - 1 ; a A b) when 
d(a, b) < 2 m< - n ) for r < n follows immediately from the proof of the subclaim. 
Therefore we may conclude that T* determines the set 

T 2 := {Th(M; a A b):beM, d(a, b) > k(n)} 

This finishes the proof. 

Revisiting Gaifman's theorem we get: 

Theorem 3.9. Every first order formula a(uo, ■ ■ ■ , u m -i) is logically equivalent to a Boolean 
combination of 

(I) Sentences of the form 

(3v a ,...,v s ^)[/\^ r \v l ) & f\ d(v i ,v j )>2r] 

i<s i<j<s 

where 4>^ r \vi) is obtained from ip( r \vj) by substituting Vi for Vj. 

(II) Local formulas of the form (3x 6 V 1 (ui))[ip(x , u)] for i < i. 

Moreover, if n = the depth of ct(u) then the following inequalities can be guaranteed: 

r<3-4™-\ s<m + n, t^Z-A 71 ' 1 . 
If a is a sentence then only sentences of the form (I) occur. 
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Let us just remark that (I) comes from asking (as in section 2) if there exists a scattered 
sequence of length m in M. 
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